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Non-structural protein 9 (Nsp9) of coronaviruses is believed to bind single- 
stranded RNA in the viral replication complex. The crystal structure of 
Nsp9 of human coronavirus (HCoV) 229E reveals a novel disulfide-linked 
homodimer, which is very different from the previously reported Nsp9 
dimer of SARS coronavirus. In contrast, the structure of the Cys69Ala 
mutant of HCoV-229E Nsp9 shows the same dimer organization as the 
SARS-CoV protein. In the crystal, the wild-type HCoV-229E protein forms a 
trimer of dimers, whereas the mutant and SARS-CoV Nsp9 are organized in 
rod-like polymers. Chemical cross-linking suggests similar modes of aggre- 
gation in solution. In zone-interference gel electrophoresis assays and sur- 
face plasmon resonance experiments, the HCoV-229E wild-type protein 
is found to bind oligonucleotides with relatively high affinity, whereas 
binding by the Cys69Ala and Cys69Ser mutants is observed only for the 
longest oligonucleotides. The corresponding mutations in SARS-CoV Nsp9 
do not hamper nucleic acid binding. From the crystal structures, a model for 
single-stranded RNA binding by Nsp9 is deduced. We propose that both 
forms of the Nsp9 dimer are biologically relevant; the occurrence of the 
disulfide-bonded form may be correlated with oxidative stress induced in 
the host cell by the viral infection. 

© 2008 Elsevier Ltd. All rights reserved. 
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Introduction 


family of viruses has increased dramatically.” Coro- 
naviruses are enveloped, positive-strand RNA 


Since a coronavirus was identified as the causative 
agent of the 2003 outbreak of severe acute respira- 
tory syndrome (SARS),'™ scientific interest in this 
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coronavirus 229E; Nsp, non-structural protein; MPD, 
2-methyl1-2,4-pentanediol; DLS, dynamic light-scattering; 
MP°, main proteinase; r.m.s., root-mean-square; SSB, 
single-stranded DNA-binding protein; SPR, surface 
plasmon resonance; ssDNA, single-stranded DNA; 
ssRNA, single-stranded RNA; OB, oligonucleotide/ 
oligosaccharide-binding; ppla, polyprotein 1a; RU, 
resonance units. 


viruses that cause a wide spectrum of disease in 
humans and animals. These viruses are divided 
into three distinct groups on the basis of genome 
organization and phylogenetic analysis. Human 
coronavirus 229E (HCoV 229E) causes a mild form 
of the common cold and belongs to group 1, which 
includes the recently discovered human coronavirus 
NL63° and the porcine coronavirus, transmissible 
gastroenteritis virus (TGEV). Human coronaviruses 
belonging to group 2 are OC43 and HKU1, the latter 
also having been discovered very recently.’ The 
SARS coronavirus has been classified as an outlier of 
group 2.° Coronaviruses infecting birds have been 
identified as a separate group (group 3).’ 

The genome of HCoV 229E consists of 27,277 
nucleotides, comprising a total of eight open reading 
frames. The entire replicase complex of the virus is 
encoded within two large overlapping open reading 
frames, ORF 1a and ORF 1b. ORF 1a codes for 
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polyprotein la (ppla) with a calculated molecular 
mass of 454 kDa. Involving a (-1) ribosomal frame- 
shift, translation of ORF 1a and ORF 1b together 
yields the giant polypeptide lab (pplab) with a 
calculated molecular mass of 754 kDa.'”"! These 
polyproteins are processed by two virus-encoded 
papain-like proteases (PL1?"° and PL2?"°)’? and the 
main proteinase (MP", also called 3C-like protease, 
3CL?"*),’° resulting in 16 non-structural proteins 
(Nsps). The crystal structure of HCoV-229E MP"° has 
been determined by our group and shown to be 
similar to that of the homologous enzyme from 
TGEV.'° The structure of the SARS-CoV MPre!®17 
is also very similar. 

The C-terminal region of ppla comprises a set of 
relatively small polypeptide domains, Nsp6—Nsp11. 
In preliminary experiments, we have shown that 
Nsp10 from mouse hepatitis (corona)virus (MHV) 
is a double-stranded RNA-binding zinc-finger pro- 
tein,'® and that HCoV-229E Nsp8 and Nsp9 interact 
with nucleic acids.'” Also, it has been proposed 
recently that Nsp9 might interact specifically with 
the stem-loop II motif (s2m), a well defined RNA 
secondary-structure element at the 3’ end of many 
coronavirus genomes.” However, s2m does not 
seem to be conserved in HCoV 229E}. Nsp8 of 
SARS-CoV has the function of an RNA primase;~' its 
8:8 complex with Nsp7 has a three-dimensional 
structure reminiscent of the 82 “sliding clamp” of 
bacterial DNA polymerase, with a central channel 
suitable for double-stranded RNA binding.” It 
has been shown by analytical ultracentrifugation 
that Nsp8 also interacts with Nsp9,** although 
according to our own measurements using surface 
plasmon resonance, this interaction is either absent 
or very weak (R.P., unpublished results). Colocaliza- 
tion of Nsp7, Nsp8, Nsp9, and Nsp10 was observed 
in MHV.** Very likely, these non-structural pro- 
teins are involved directly in the replication complex 
built around the RNA-dependent RNA polymerase 
(Nsp12). 

Here, we describe the crystal structures of wild- 
type HCoV-229E Nsp9 at 1.75 A resolution and its 
Cys69Ala mutant at 1.80 A. In spite of 45% sequence 
identity between SARS-CoV and HCoV-229E Nsp9, 
the wild-type structure of the latter exhibits a mode 
of homodimerization that is entirely different from 
what has been observed in the crystal structure of 
the former.” To probe the effect of the observed 
intermolecular disulfide bridge on the formation of 
the HCoV-229E Nsp9 dimer, Cys69 was mutated to 
alanine. The crystal structure of this Nsp9 mutant 
shows a dimerization mode similar to that observed 
in SARS-CoV Nsp9.**”” However, gel mobility-shift 
assays and surface plasmon resonance (SPR) mea- 
surements indicate that only the wild-type HCoV- 
229E Nsp9, not the Cys69Ala mutant, binds 
strongly to single-stranded RNA and single- 
stranded DNA. In order to assess a possible direct 
role of Cys69 in nucleic acid binding, this residue 
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was also replaced by serine. Again, the mutant 
showed little or no affinity to single-stranded DNA 
(ssDNA). Finally, the corresponding residue (Cys73) 
of SARS-CoV Nsp9, which did not form a disulfide 
bond, was replaced by alanine and serine. Both 
mutants showed wild-type affinity to single- 
stranded oligonucleotides. It is therefore concluded 
that Nsp9 of HCoV 229E is substantially different 
from its orthologue in SARS-CoV. 


Results 


Structure elucidation and quality of the 
structural models 


Wild-type HCoV-229E Nsp9 and its Cys69Ala and 
Cys69Ser mutants were cloned with a His, tag 
connected to the N terminus of the protein via the 
linker sequence VKLQ. The latter tetrapeptide cor- 
responds to the C terminus of SARS-coronavirus 
Nsp8 (as well as HCoV-229E Nsp8) and therefore 
introduces a cleavage site for the main proteinase 
(MP"°) of SARS-CoV. After purification of the His¢- 
tagged protein using Ni-NTA chromatography, 
cleavage with the MP" yielded Nsp9 with an 
authentic N terminus. The wild-type Nsp9 was 
crystallized using a reservoir containing 1.8-2.1 M 
ammonium sulfate, 0.1 M sodium acetate pH 4.0- 
4.5, and 5% (v/v) 2-methy]l-2,4-pentanediol (MPD). 
Crystals were of space group P622, with a monomer 
in the asymmetric unit (Table 1). The structure was 
determined by molecular replacement, using a 
monomer of the SARS-CoV Nsp9*° as the search 
model, and refined to 1.75 A resolution. Residues 1— 
7 and 33-36 could not be modeled due to lack of 
electron density. Alternate conformations were 
detected in the electron density for the side-chains 
of Met9 and Lys82. The final R-factor for the struc- 
tural model is 19.0% and the Rgee is 22.4%; 97.2% of 
the amino acid residues are in the most-favored 
regions of the Ramachandran plot and the remain- 
der in the additionally allowed regions.”° 

The Cys69Ala mutant of HCoV-229E Nsp9 was 
prepared by single-site PCR mutagenesis from the 
wild-type plasmid. Preparation of the protein was 
identical with wild-type Nsp9. The conditions iden- 
tified for crystallization of the wild-type Nsp9 failed 
to yield crystals of the mutant. Instead, the following 
crystallization conditions were established: 0.2 M 
ammonium sulfate, 0.2 M sodium acetate (pH 4.6), 
30% (w/v) polyethylene glycol monomethy! ether 
(PEG-MME) 2000. The crystals displayed space 
group P2,2,2;, with a dimer of the Nsp9 mutant 
per asymmetric unit (Table 1). Residues 1, 2 and 109 
of monomer A have not been modeled due to lack of 
electron density; the same is true for residues 14 
and 107-109 of monomer B. The segment compris- 
ing residues 53-56 could be built into electron 
density but proved to be very flexible. The structure 
was refined to a resolution of 1.80 A, with R=22.1% 
and Rfree= 28.1% (see Supplementary Data Fig. S1). 
Of the amino acid residues in the structural model, 
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Table 1. Data collection and refinement statistics 


Nsp9 wild type | Nsp9 mutant 


A. Data collection 


Wavelength (A) 0.8075 0.8075 
Resolution (A) 40.0-1.75 30.0-1.80 
(1.79-1.75) (1.86-1.80) 

Space group P622 P24242, 
Unit-cell parameters 

a (A) 85.63 26.40 

b (A) 85.63 61.38 

c (A) 48.69 107.31 
Solvent content (%, v/v) 42.3 S15 
Overall reflections 129,656 139,726 
Unique reflections 11,317 (730) 16,842 (1648) 
Multiplicity 11.5 (11.5) 8.3 (4.7) 
Completeness (%) 99.9 (100.0) 99.4 (99.7) 
Rmerge’ (%) 8.3 (60.3) 8.9 (35.1) 
T/o(D) 13.7 (4.45) 19.7 (3.96) 
B. Refinement 
Resolution (A) 40.0-1.75 30.0-1.80 
Rewer 0.190 0.221 
Reree? 0.224 0.281 
rm.s.d. from ideal geometry 

Bond lengths (A) 0.013 0.017 

Bond angles (°) 1.417 1.962 
Protein atoms 778 1604 
Solvent atoms 65 74 
MPD 1 - 
DIT - al 
Sulfate 2 - 
Ramachandran plot regions 

Most favoured (%) 97.2 92.5 

Additionally allowed (%) 2.8 7.5 

Generously allowed (%) 0 0 

Disallowed (%) 0 0 


Values in parentheses are for the highest resolution shell. 

° Rmerge = Donkt dil (hkl) —(k1)) | / Yaka U(hkl);, where I(hkl) is 
the intensity of reflection hkl and (I(hkl)) is the average intensity 
over all equivalent reflections. 

. Reyst= Dank |Fo(tkl)—F Chk] | / SniFo(hkl). Reee was calculated 
for a test set of reflections (5%) omitted from the refinement. 


92.5% are in the most favored regions of the 
Ramachandran plot and the remainder are in the 
additionally allowed regions. 


Overall structure of the Nsp9 monomer 


Crystals of wild-type HCoV-229E Nsp9 contain 
one monomer per asymmetric unit, which forms a 
homodimer due to the crystallographic twofold axis 
(see below). The fold of the monomer is related to 
the oligonucleotide/oligosaccharide-binding mod- 
ules (OB fold). This fold is characteristic of proteins 
binding to single-stranded nucleic acids*’ and 
occurs, for example, in ssDNA-binding proteins 
from bacteria”® to man”? as well as in viruses.°° The 
canonical OB fold comprises five antiparallel (- 
strands that form a partial barrel, and an a-helix that 
packs against the bottom of the barrel, usually in an 
orientation along the long axis of the B-barrel cross- 
section.” In the classical OB fold, the a-helix is 
interspersed between B-strands 3 and 4, but in Nsp9 
the helix is appended at the C terminus of the 
polypeptide chain (residues 92-108). Also, Nsp9 has 
two extra f-strands (strands 6 and 7) forming a long 


hairpin (L67). Some of the loops connecting the B- 
strands, e.g. L12, L23, L45, and L67 (see Fig. 1), are 
very flexible. 

In the electron density map for wild-type HCoV- 
229E Nsp9, we unequivocally located an MPD 
molecule that fills a space between strand 82 and 
the C-terminal a-helix in the monomer, very much 
in agreement with the commonly observed binding 
pattern for this amphiphilic additive.*’ The hydro- 
xyl groups of the MPD make hydrogen bonds with 
Asn27 and a (half-occupied) sulfate ion, which is in 
turn interacting with one of two alternative side- 
chain conformers of Lys82 and the main-chain 
amide of Asn27. The hydrophobic side of the MPD 
interacts with Leu29 and packs against the helix 
near Vall06 (Supplementary Data Fig. $2). This 
observation explains nicely why 5% MPD was an 
essential additive in the crystallization of HCoV- 
229E wild-type Nsp9, in addition to the (NH4)2SOx. 

The structure of the monomer of the HCoV-229E 
Nsp9 Cys69Ala mutant displays an r.m.s. deviation 
of 0.71 A from the wild-type monomer (for 92 C* 
atoms of chain A of the mutant; the corresponding 
values for chain B are 87 C“ atoms and 0.67 A; see 
Fig. 1). In this calculation, residues 1-7, 33-36 (loop 
L23), and 107-109 have been omitted because of 
weak or non-visible electron density in one or both 
of the two structures. Larger than average devia- 
tions occur in loops L12 (residues 19-22) and L45 
(residues 55-60; the tip of this loop at residue Ser58 
deviates by 4.36 A and 2.89 A between wild-type 
and mutant molecules A and B, respectively). The r. 
m.s. deviation between monomers A and B of the 
Cys69Ala mutant is 0.96 A (for 99 C* atoms). 

In contrast to the wild-type Nsp9, MPD was not a 
useful additive in crystallization experiments with 
the mutant protein. However, dithiothreitol (DTT) 


L23 


Fig. 1. Superimposition of monomers. Ribbon repre- 
sentation of HCoV-229E Nsp9 wild-type (green) and 
Cys69Ala mutant (red) monomers, superimposed with a 
C* rm.s. deviation of 0.71 A. Loop L23 of wild-type 
HCoV-229E Nsp9 could not be built due to the lack of 
electron density. 
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was essential. Again, we located extra electron den- 
sity between Asn27 and the a-helix (this time near 
Tle102) and attributed this to a DTT molecule, al- 
though the assignment was not as unambiguous as 
was the identification of MPD ina nearby location in 
the wild-type protein. 


Comparison of the Nsp9 monomer with 
SARS-CoV Nsp9 


The HCoV-229E Nsp9 monomer is also very si- 
milar to the monomer of SARS-CoV Nsp9” (PDB 
code 1QZ8), with an r.m.s. deviation of 0.75 A for 84 
C* atoms of wild-type Nsp9 (for chain A of 1QZ8; 
the value for chain B is 0.66 A). The values for the 
mutual comparisons between the individual chains 
of the Cys69Ala mutant and those of 1QZ8 are 
between 0.76 A and 1.23 A. Interestingly, the other 
available crystal structure for SARS-CoV Nsp9*? 
(PDB code 1UW7) is significantly more distant in 
terms of rm.s. deviations, with 1.75 A for 94 C® 


atoms of the wild-type HCoV-229E Nsp9, and 1.39 A 
for 94 C“ atoms of the Cys69Ala mutant. 


Structure of the Nsp9 dimer 


Wild-type HCoV-229E Nsp9 forms a disulfide- 
linked homodimer, with the twofold crystallographic 
axis of symmetry running through the disulfide bond 
formed between the Cys69 residues of each monomer 
(Fig. 2a). The a-helix of each monomer is also 
involved in dimerization through formation of two 
hydrogen bonds between the Asn92 side-chain and 
the main-chain of residue 74 (f-strand 6), as well as 
one H-bond between the Asn92 side-chain amide and 
the C-terminal carboxylate (residue 109; Fig. 2b). 
Among the sequenced coronaviral Nsp9 proteins, this 
asparagine is present only in HCoV 229E and SARS- 
CoV; other coronaviruses have either Thr or Ser at this 
position. There is a fourth hydrogen bond donated by 
the N™ atom of totally conserved Arg95 (helix H1; see 
Supplementary Data Fig. S3 for sequence alignment) 


Fig. 2. Structural features of the homodimers of wild-type HCoV-229E Nsp9 and the Cys69Ala mutant; the two 
monomers are colored red and green, respectively; the disulfide, where present, is shown in yellow. N and C denote the 
amino and carboxy termini, respectively, of the polypeptide chains. (a) Ribbon representation of the disulfide-linked wild- 
type HCoV-229E Nsp9 dimer. (b) Residues involved in the dimer interface of wild-type Nsp9 (sticks; red, oxygen; blue, 
nitrogen; yellow, carbon). Intermolecular hydrogen bonds are indicated by broken lines. (c) HCoV-229E Cys69Ala mutant 
dimer. (d) Residues involved in the dimer interface of the mutant Nsp9. Color code is the same as in b. The closest 
approach between the C-terminal a-helices, between the C* atoms of Gly A100 and Ala B97, is indicated by a dotted line. 
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to the main-chain carbonyl of residue 72 (B-strand 6). 
Because of the crystallographic twofold symmetry, all 
of these interactions are duplicated in the dimer, so 
that there are eight intersubunit hydrogen bonds 
(Fig. 2b). In spite of the presence of these favorable 
interactions, there seem to be a number of less ideal 
interactions. For example, Arg70 does not have an 
interaction partner proper; its side-chain makes 
contacts with Pro67 and Pro68. Its intersubunit 
contacts are in fact determined by the nearby disulfide 
bond (Cys A69-Cys B69). There is a weak, but 
favorable interaction between the side-chains of Phe 
A71 and Phe B86. The two a-helices (residues 92-108) 
that are part of the interface are in an antiparallel 
orientation (describing an angle of 167°, Fig. 2a) witha 
close approach of 3.77 A between the C* atoms of Gly 
A96 and Gly B103. In between these two glycines of 
the a-helix, there is a third one, Gly100, which also 
facilitates the close approach of the two helices (the 
distance to its symmetry mate is 4.13 A; see Supple- 
mentary Data Fig. 54). Gly96 and Gly100 are com- 
pletely conserved among the coronaviruses, and 
Gly103 is present in group 1 coronaviruses (Supple- 
mentary Data Fig. S3). The surface area per Nsp9 
monomer buried through dimer formation” is 
985 A’. 

Although the residue responsible for disulfide 
formation in HCoV-229E Nsp9, Cys69, is conserved 
in SARS-CoV Nsp9, and the sequence identity is as 
high as 45% between the two proteins (see Supple- 
mentary Data Fig. 53), the mode of dimerization in 
the latter is very different from what we observe in 
our structure. A disulfide bond is not formed, and 
the dimerization interface involves mainly the a- 
helix of each subunit, but in a parallel rather than 
anti-parallel orientation.*’” In contrast to HCoV- 
229E Nsp9, which we had prepared with authentic N 
and C termini, the SARS-CoV protein used by Sutton 
et al.”* for crystallization carried 30 additional amino 
acid residues at the N terminus, due to the cloning 
procedure. From the structure,” it is evident that the 
additional N-terminal segment leads to formation of 
a B-hairpin involving residues —7 to 8, as well as an 
intermolecular salt bridge between GluA(-7) and 
ArgB111. This additional interaction might favor the 
dimerization mode seen in the Sutton et al. structure. 
However, the structure published by Egloff et al.° 
for SARS-CoV Nsp9 was derived from a protein that 
carried only six additional histidine residues at its N 
terminus (B. Canard, personal communication). 
These residues were not seen in the electron density 
maps, presumably due to disorder, and certainly are 
not involved in intersubunit interactions. Yet, this 
structure still features a mode of dimerization that is 
highly similar to that described by Sutton et al.*° and 
completely different from that observed by us for 
HCoV-229E Nsp9. 

Disulfide bonds are rare in proteins in the cytosol, 
where the environment is of reducing character, and 
we therefore have to discuss the possibility that the 
dimerization mode seen in our structure of HCoV- 
229E Nsp9 is an artifact of disulfide bond formation, 
in spite of the presence of 5 mM DIT throughout 


protein preparation and crystallization (higher con- 
centrations of DTT prevented crystallization of the 
protein). In order to assess the role of the disulfide 
bond in dimer formation, we replaced Cys69 in 
HCoV-229E Nsp9 by alanine. The structure of the 
mutated protein (Fig. 2c) revealed a dimer that is 
grossly different from that of wild-type HCoV-229E 
Nsp9: When superimposing monomer A of the mu- 
tant structure onto the same monomer of the wild- 
type protein, the centroid of monomer B deviates 
from its position in the wild-type protein by 23.5 A, 
and the angle of rotation between the two positions 
of monomer B is 72°. However, this dimerization 
mode is identical with that of SARS-CoV Nsp9 
(cf. Fig. 2c and Supplementary Data Fig. 55). The 
dimer of the HCoV-229E Cys69Ala mutant can be 
superimposed onto the dimer of the SARS-CoV 
Nsp9 protein (1QZ8~) with an r.m.s. deviation of 
0.99 A for 175 C* atom pairs (see Supplementary 
Data Fig. S5). The r.m.s. deviation is much higher 
(2.7 A for 191 C* atom pairs) for the SARS-CoV Nsp9 
structure described by Sutton et al.”° (1UW7); this is 
very likely due to the disturbances of the latter struc- 
ture by the N-terminal tag residues. In the HCoV- 
229E Nsp9 mutant, dimerization appears to rely ona 
few interactions only. There is no single proper 
hydrogen bond between the two monomers, and 
only a few hydrophobic contacts mediate the 
interaction (Fig. 2d). The immediate N terminus is 
disordered, but residues 3-6 and 5-7 of the A and B 
chain, respectively, lie over the other monomer and 
interact weakly with conserved Phe71 (strand 6) 
and with the C-terminal a-helix near residues Leu99 
and Gly103. The majority of the interactions between 
the two monomers is made by the two helices, one 
from each monomer, that run largely in parallel in 
this dimer, crossing at an angle of 48° and a closest 
approach of 3.96 A (C°-C%) between Ala97 and 
Gly100 (Fig. 2d). But again, the hydrophobic contacts 
between the helices are only weak. Importantly, 
neither of the two helices deviates from its ideal 
geometry, including the intrahelical hydrogen 
bonds, for the benefit of the intermolecular contacts. 
The surface area buried upon dimer formation” is 
687 A? for the mutant protein. This value is sig- 
nificantly lower than that observed for the dimeriza- 
tion mode seen for the wild-type HCoV-229E Nsp9 
(see above). Even if a few more intermolecular 
interactions were made by the disordered N-term- 
inal residues not seen in the electron density map 
(residues 1 and 2 of chain A and 1-4 of chain B), this 
value would not increase dramatically, and such 
interactions would very likely not be strong (other- 
wise these residues would not be disordered). In 
summary, the monomer—monomer interface of the 
Cys69Ala mutant of HCoV-229E Nsp9 is far from 
ideal and appears to be much weaker than the 
interface seen in the wild-type protein. 


Higher oligomers in the crystal of wild-type Nsp9 


In the crystal structure of wild-type HCoV-229E 
Nsp9, three dimers are arranged to form a hexamer 
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with 32 symmetry (Fig. 3). There are multiple 
interactions between monomers across the hex- 
amer, which we discuss briefly here, according to 
the color scheme used in the figure. At the center of 
the hexamer, there are two disordered sulfate ions 
sitting on the crystallographic threefold axis, 19.0A 
apart. They make ionic interactions (2.58 A from 
the closest sulfate oxygen) with the Lys50 side- 
chains (N* atom) of the red, green and blue mono- 
mers (upper layer of the hexamer) and the yellow, 
cyan, and magenta monomers (lower layer), res- 
pectively. Also, Asp19 of each monomer forms a 
2.90 A intermolecular hydrogen bond with the 
Lys50 residue (e.g., red-green), as well as a 4.76 A 
intramolecular salt bridge with Lys50 of the same 
monomer (e.g., red-red). As a result, we find a ring 
of ionic interactions formed by Asp19 (loop L12) 
and Lys50 (L34) residues along both the upper and 
the lower rim of the central cylinder inside the 
trimer of dimers (Fig. 3). Asp19 is present in most 
coronavirus Nsp9 sequences (not in SARS-CoV and 
IBV), whereas Lys50 is highly conserved across the 
family (see Supplementary Data Fig. 3). The cy- 
lindrical hole running along the threefold axis is 
only about 4.4 A wide, i.e. large enough for sul- 
fate ions, but too small to accommodate single- 
stranded nucleic acid at the center of the hexamer 
(Fig. 3). 

Additional intermolecular interactions within the 
Nsp9 hexamer are listed in Supplementary Data 
Table S1. Through the sixfold axis of the crystal, 
hexamers are arranged into 36-mers (Supplemen- 
tary Data Fig. S6). 


Nsp9 polymers in the crystal of the Cys69Ala 
mutant 


In the crystal structure of the Cys69Ala mutant dimer 
of HCoV-229E Nsp9, a second protein-protein interface 
(termed M2; Supplementary Data Table S1) is formed 
through the close approach of strands £5 of neighbor- 
ing molecules, although only one hydrogen bond is 
formed (Ser58 O...Glu64 N, 2.62 A). In addition, there 
are two hydrogen bonds donated by the guanidinium 
group of Arg94 (in the helix) to the carbonyl oxygen of 
Gly34 (L23; 2.99 and 3.14 A). Furthermore, there is yet 
another H-bond between the side-chains of Asn89 
(loop L7H) and Asp57 (L45; 2.99 A). The surface area 
buried by formation of this dimer is 450 A*. Together 
with the monomer-monomer interface M1 described 
above, M2 leads to the polymerization of the protein 
(see Discussion, Fig. 7c). Yet another protein-protein 
interface (M3) with quasi-twofold symmetry, but not 
involved in polymer formation, is mentioned in 
Supplementary Data Table S1. 


Oligomeric state in solution 


In order to determine the oligomeric state of wild- 
type and mutant HCoV-229E Nsp9 in solution, we 
applied a number of biophysical and biochemical 
techniques. For both wild-type HCoV-229E Nsp9 
(fresh preparation) and the Cys69Ala mutant, 
Dynamic Light-Scattering (DLS) revealed a mono- 
disperse peak centered at a hydrodynamic radius of 
28+1.4 A, indicating that the homodimer is the pre- 
valent species in solution. A similar result was ob- 


Fig. 3. Ribbon representation of 
the wild-type HCoV-229E Nsp9 
hexamer. Three dimers of the pro- 
tein form a hexamer through the 32 
axis of symmetry. The threefold axis 
is at the center of the hexamer. Nsp9 
monomers in the upper layer are 
colored red, blue and green, and 
those in the lower layer are colored 
yellow, cyan and magenta. The two 
sulfate ions on the threefold axis are 
indicated in the same colors. Each 
sulfate is three-fold disordered. 
The twofold axes run between the 
monomers. 
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tained by analytical gel-filtration, which showed a 
single peak corresponding to a molecular mass of 
~26 kDa for both wild-type and mutant (data not 
shown). In agreement with the crystal structure, the 
DLS experiment revealed the presence of higher 
oligomers upon addition of small amounts of sulfate 
ions (up to 9.7 mM; data not shown). 

Glutaraldehyde crosslinking was done for HCoV- 
229E Nsp9 wild-type, the Cys69Ala mutant, and 
SARS-CoV Nsp9. Glutaraldehyde (0.01%, v/v) was 
used with different concentrations of protein ran- 
ging from 10 1M to 100 uM. The HCoV-229E Nsp9 
Cys69Ala mutant and SARS-CoV wild-type Nsp9 
showed similar crosslinking products correspond- 
ing to monomers, dimers, trimers, tetramers, and 
higher oligomers (Fig. 4). In HCoV-229E Nsp9, the 
wild-type showed only monomers, dimers, and 
trimers. The increasing presence of higher-molecular 
mass species correlated with increasing protein 
concentration. This pattern did not change in the 
presence of 36-mer or 51-mer ssDNA of random 
sequence (not shown). 


Oxidation state of Cys69 in solution 


By titration of free sulfhydryl groups with Ellman’s 
reagent,*’ we found that wild-type HCoV-229E Nsp9 
has no free cysteine in solution; i.e. the disulfide bond 
exists in solution as well. However, in a more recent 
preparation of the wild-type protein, reaction with 
Ellman’s reagent immediately after purification of 
wild-type Nsp9 did indicate the presence of one free 
cysteine per mole of protein. Crystallization of this 
sample yielded crystals overnight that were of 
the same habit as the original crystals obtained 
for the wild-type protein, with unit cell parameters 
a=b=85.4 A and c=48.8 A in space group P622. 
When we investigated this phenomenon further, we 
observed that the formation of the disulfide bond 
seemed to depend on the age of the protein 
preparation. It is possible that in the presence of 
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oxygen, gradual oxidation of the protein (probably 
correlated with the oxidation of DTT) may lead to 
formation of the disulfide bond, resulting in the di- 
merization mode visualized by X-ray crystallogra- 
phy. The concentration of DTT required to reduce 
the disulfide bond completely was determined as 
10 mM by SDS gel electrophoresis (see Materials and 
Methods). With concentrations of DTT up to 4 mM, 
the dimer was the dominant species visible on the 
gel, whereas above, the monomer was more 
pronounced. The dimer band vanished completely 
at 10 mM DTT. This result was the same indepen- 
dent of the presence or the absence of a heating step 
(95 °C for 5 min). 

In contrast to HCoV-229E Nsp9, we could show 
by using Ellman’s reagent that SARS-CoV Nsp9, 
which has three cysteine residues and which we had 
prepared the same way as its HCoV-229E ortholo- 
gue (i.e., with authentic N and C termini), had three 
free sulfhydryl groups per mole in solution even 
after several weeks of storage. 


Binding of nucleic acids 


Gel mobility-shift assay 


Using a gel mobility-shift assay (a modified ver- 
sion'® of zone-interference gel electrophoresis, 
ZIGE™*; Fig. 5), we found that wild-type HCoV- 
229E Nsp9 bound to single-stranded oligodeoxy- 
nucleotides (6-mers to 50-mers; Fig. 5a, lanes 3-10) 
and, to a very limited extent (if at all), to a double- 
stranded oligodeoxynucleotide (24-mer; Fig. 5a, 
lane 2). Nsp9 of SARS-CoV also bound to both 
single-stranded and, again very weakly, to double- 
stranded oligonucleotides (Fig. 5b). However, the 
effect on the gel mobility shift did not increase 
smoothly with oligonucleotide length; rather, there 
was a stepwise increase from the 13-mer (Fig. 5b, 
lane 3; no shift) via the 18-mer and 24-mer (lanes 4 
and 5) and the 30--45-mers (lanes 6-9) to the 50-mer 
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Fig. 4. Nsp9 crosslinking using glutaraldehyde. Crosslinking was carried out with different concentrations of protein 
(10-100 1M) using 0.01% (v/v) glutaraldehyde. The molecular mass of the cross-linking products is indicated. Wild-type 
SARS-CoV Nsp9 and the HCoV-229E Nsp9 Cys69Ala mutant form higher oligomers at a protein concentration of 100 4M, 
presumably involving interactions similar to those seen in the crystal structure. In contrast, wild-type HCoV-229E Nsp9 


does not form oligomers higher than trimers. 
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Fig. 5. Gel mobility-shift assay 
(zone-interference gel electropho- 
resis, 1% agarose; see Materials 
and Methods) probing oligonucleo- 
tide binding to Nsp9. (a) Wild-type 
HCoV-229E Nsp9; (b) wild-type 
SARS-CoV Nsp9. Lane 1, protein 
without ssDNA; lane 2, 24-mer 
dsDNA; lanes 3-10, various lengths 
of ssDNA from 6-mer to 50-mer. 
Wild-type HCoV-229E Nsp9 dis- 
plays a linear increase of the shift 
with increasing length of ssDNA, 
whereas the increase is step-wise 
for SARS-CoV Nsp9. (c) Gel mobi- 
lity-shift analysis for mutant pro- 
teins, compared to the correspond- 
ing wild-type. Lanes 1, protein 
without ssDNA; lanes 2, 24-mer, 
and lanes 3, 55-mer ssDNA with 
protein. The HCoV-229E Nsp9 
Cys69Ala mutant (HAM) and the 


SSM 


23 1 2 3 


Cys69Ser mutant (HSM) do not show any shift with the 24-mer (lane 2) and only a small shift with the 55-mer (lane 3), 
whereas the SARS-CoV Nsp9 Cys73Ala mutant (SAM) and the Cys73Ser mutant (SSM) exhibit shifts with the 55-mer 
oligonucleotide that are similar to wild-type SARS-CoV Nsp9. The upper bands (gray) in lanes 3 for HAM and HSM 
correspond to precipitated, unbound 55-mer oligonucleotide (not stained by Coomassie brilliant blue; see Materials and 


Methods). 


(lane 10). In contrast, nucleic acid binding by the 
HCoV-229E Cys69Ala mutant was not detectable 
with this method, except for a very weak shift with 
the 55-mer (Fig. 5c, lane HAM 3). As the apparent 
inability of the Cys69Ala mutant to bind nucleic 
acids could depend on the lack of a direct (hydrogen 
bonding) interaction between the cysteine and the 
oligonucleotides, we also replaced the cysteine by 
serine. The Cys69Ser mutant did not bind short 
oligonucleotides either, but did show some gel shift 
with the 55-mer (Fig. 5c, lane HSM 3). Next, we 
replaced the corresponding cysteine in SARS-CoV 
Nsp9 by alanine and serine. Both the Cys73Ala and 
Cys73Ser mutants displayed a similar shift in the 
presence of the 55-mer oligodeoxynucleotide as the 
wild-type protein (Fig. 5c, lanes SAM 3 and SSM 3). 
Reduction (by 50 mM DTT) or oxidation (by 17.5% 
HO.) of the disulfide-containing wild-type HCoV- 
229E Nsp9 did not change the gel mobility-shift 
pattern of the protein in the presence of nucleic acids 
(not shown). 


Surface plasmon resonance 


Nsp9 binding to ssDNA was analyzed using SPR 
experiments. For this, a 5/-biotinylated 50-mer 
oligonucleotide was immobilized on a streptavi- 
din-coated chip (SA chip). Freshly prepared Nsp9 
was treated with 5 mM DTT directly before 
injection. We observed a signal for Nsp9 interaction 
with the oligonucleotide when concentrations of 
protein were in the micromolar range (Supplemen- 
tary Data Fig. S7). Apparent Kp values for the wild- 
type Nsp9 from HCoV_229E and SARS- CoV were 
determined as 28 uM (x*=1.29) and 29 1M (x7=8.52 


for a single-state binding model), respectively. In 
fact, the SARS-CoV Nsp9 binding profile is better 
explained by a two-state binding model (x7=0.73). 
This is not true for HCoV-229E Nsp9, the binding 
profile of which agrees well with a single-state 
binding model. However, we could not use con- 
centrations of HCoV-229E Nsp9 greater than 35 uM 
(SARS-CoV Nsp9: 85 uM), because non-specific 
binding appeared to govern the profile above this 
value and saturation was not reached. We could also 
not derive Kp values for the oxidized form of wild- 
type HCoV-229E Nsp9 nor for the SARS-CoV and 
HCoV-229E Nsp9 mutants because of the same 
phenomenon. 

We compared the reduced and oxidized state (i.e. 
in the presence and in the absence of 5 mM DTT, 
respectively) of wild-type HCoV-229E Nsp9 with 
respect to binding the oligonucleotide in the SPR 
experiment. For this, the oligonucleotide was again 
immobilized onto the chip. Wild-type HCoV-229E 
Nsp9 (20 1M, freshly prepared (less than three days 
old), containing one free thiol group per mole) and 
the HCoV-229E Cys69Ala mutant (20 1M) were 
injected into the flow cell, with the constant 
presence of DTT in the running buffer. Also, 
20 uM aged preparation (more than two weeks 
old, no free cysteine) of wild-type HCoV-229E Nsp9 
was injected without DTT in the running buffer. 
Freshly prepared wild-type (in the presence of 
DTT) and Cys69Ala mutant protein showed similar 
binding curves with the oligonucleotide and gave a 
maximum response (Rmax) of 33 resonance units 
(RU). In contrast, the aged preparation of wild-type 
HCoV-229E Nsp9 displayed an Rmax of 83 RU, 
indicating much stronger binding to the nucleic 
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Fig. 6. HCoV-229E Nsp9 binding to ssDNA analyzed 
by surface plasmon resonance, in the presence and in the 
absence of DTT. A 5/-biotinylated 50-mer oligonucleotide 
was immobilized to an SA chip up to 88 RU. a and b, The 
binding curves for a 20 1M fresh preparation of the Nsp9 
Cys69Ala mutant and for wild-type Nsp9, respectively, 
both injected in the presence of 5 mM DTT. c, The binding 
curve for a 20 .M aged preparation of wild-type Nsp9, 
injected in the absence of 5 mM DTT. 


acid than that observed for the fresh preparation or 
the mutant (Fig. 6). 


Discussion 


In this study, we observed different dimerization 
modes for HCoV-229E Nsp9 by X-ray crystallogra- 
phy. The wild-type protein exhibits a homodimer 
that is very different from that seen previously for 
SARS-CoV Nsp9,”°”? in spite of a sequence identity 
of 45% between the two proteins. In HCoV-229E 
Nsp9, dimerization is mediated by a disulfide 
bridge, a few hydrogen bonds, and by hydrophobic 
interactions between the C-terminal helix of each 
monomer. One major difference between our pre- 
paration of HCoV-229E Nsp9 and that of SARS-CoV 
Nsp9 by both Egloff et al. ?” and Sutton et al. ** is that 
we have worked with a protein with authentic chain 
termini, whereas the SARS-CoV protein used by 
those authors has N-terminal extensions from the 
cloning procedure (a Hisg tag in the case of the Egloff 
et al. structure, and an extra 30 residues in the Sutton 
et al. structure). Interestingly, the differences of the 
N-terminal extensions between the two reported 
structures of SARS-CoV Nsp9 led to deviations in 
the dimer in detail, resulting in a rather high r.m.s. 
deviation of 2.07 A (for C“ atoms) between the two 
models. Residues —7 to —2 of the N-terminal tag 
present in the Sutton et al. structure form an extra 
antiparallel B-sheet with residues 3-8 of the protein, 
thereby pushing away the B6-B7 hairpin (L67; 
Supplementary Data Fig. 55) and causing the C- 
terminal part of the a-helix to kink. Regardless of 
whether the presence of the extra residues at the N 
terminus of the SARS-CoV Nsp9 preparations used 
for structure determination results in artifacts, the 
observation of a completely different, disulfide- 
linked dimer in HCoV-229E Nsp9 is remarkable. 


The occurrence of a disulfide bond in a viral pro- 
tein located in the cytoplasm of the infected cell is 
unexpected, because here the overall milieu is 
reductive and disulfide bonds are rare, although a 
few cytosolic proteins containing them have been 
described.*°** Therefore, we have to take into 
account the possibility that formation of the dis- 
ulfide is an artifact of the conditions of protein pre- 
paration. In order to probe the effect of the disulfide 
bridge, Cys69 of HCoV-229E Nsp9 was mutated to 
alanine. Surprisingly, the crystal structure of the 
mutant displays the same dimerization mode as 
SARS-CoV Nsp9 and is thus very different from the 
wild-type HCoV-229E dimer. 

We tried to assess the relevance of the two 
different dimers seen in our structures by calculating 
the surface area buried upon dimerization as well as 
the shape complementarity (Supplementary Data 
Table $1).°’ The shape complementarity of the 
monomer-monomer interface in the wild-type 
structure is as low as 0.56, but that of the mutant 
is not much better (0.63). For comparison, this value 
is 0.67 and 0.70 for the two crystal structures of 
SARS-CoV Nsp9,”°”? respectively, which show the 
same dimerization mode as the Cys69Ala mutant of 
HCoV-229E Nsp9 (the artificial N-terminal tag, 
which also makes intersubunit contacts, has been 
removed from the Sutton et al. structure (PDB code 
1UW7) in this calculation). 

Both wild-type and mutant dimer contacts are 
mediated mainly by the C-terminal helix of one 
monomer interacting with its (quasi-)symmetry 
mate in the other. However, the helices pack against 
one another in different orientations. The amino acid 
residues of the helix that are involved in the inter- 
action are highly conserved and small (GX3GX2GA), 
allowing helix packing according to the ridges- 
into-grooves model (Supplementary Data Fig. $4).*° 
The GXXXG motif is actually a common feature of 
the association of transmembrane helices.*”*° In 
wild-type HCoV-229E Nsp9, residues 1, 4 and 7 of 
the helix sequence given above are involved in the 
interaction, allowing a close approach of the helices 
in an antiparallel orientation with an angle of 167°. 
In contrast, in the mutant structure, residues 1, 4, 
and 8 are involved in the stabilization of a parallel 
orientation with a crossing angle of 48°. The fact that 
the amino acid sequence of the C-terminal helix of 
Nsp9 allows stabilization of both forms of the dimer 
may support the idea that both forms are indeed 
biologically relevant (see below). 

In addition to the dimerization modes that we 
identified in the “parent” dimer of the wild-type 
protein and the Cys69Ala mutant, we have to 
consider a number of additional protein-protein 
interfaces that are seen in the crystal structures (see 
Supplementary Data Table S1). In the wild-type 
protein, three disulfide-bonded dimers form a 
trimer of dimers, or hexamer, involving interfaces 
W2 and W3. Hexamers are assembled into 36-mers 
through interface W4 (Supplementary Data Fig. S6). 

In the case of the Cys69Ala mutant, there are two 
other monomer—monomer interfaces in the crystal, 
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in addition to the “parent mode” (M1). M3 is formed 
by the 86-7 hairpin and involves mainly hydro- 
phobic interactions between side-chains that are not 
conserved (Supplementary Data Table S1), although 
we note that a similar interface exists in the Egloff 
et al. ~ structure of SARS-CoV Nsp9. The other 
interface, M2, arises through some limited interac- 
tion between strands (5 of neighboring molecules in 
the crystal. M2 has four hydrogen bonds between 
main-chain atoms and is reminiscent of intersubunit 
interactions involving P-strands in the ssDNA- 
binding protein (SSB) from Escherichia coli, a proto- 
type OB-fold protein.** This same alternative dimer- 
ization mode was discussed by Sutton et al. ** for 
their structure of the SARS-CoV Nsp9, but was 
considered irrelevant. However, this interface is 
found also in the SARS-CoV Nsp9 structure 
described by Egloff et al. * even though the space 
group of these crystals is different. In summary, both 
dimerization modes M1 and M2 occur in all crystal 
structures of Nsp9 described so far (except that of 
the HCoV-229E wild-type protein), even in a second 
crystal form mentioned briefly by Sutton et al. * (for 
which no data have been deposited in the Protein 
Data Bank), i.e. in a total of four different crystalline 
environments. We therefore propose that Nsp9 
oligomerization, in particular in the presence of 
ssRNA, is mediated through these two protein— 
protein interaction surfaces. The existence of such 
oligomers in solution is supported by our glutar- 
aldehyde crosslinking experiments, which revealed 
the presence of monomers, dimers, trimers, and 
higher oligomers for wild-type SARS-CoV Nsp9 and 
HCoV-229E Cys69Ala Nsp9 in SDS-PAGE under 
reducing conditions (Fig. 4). Interestingly, for the 


(a) , (b) , 


wild-type 229E protein, only monomers, dimers, 
and, to a limited extent, trimers were seen by this 
method. This is consistent with the observed crystal 
structures: when in the disulfide-linked state, wild- 
type 229E Nsp9 cannot normally form oligomers 
larger than hexamers (Fig. 3), whereas the Cys69Ala 
mutant as well as SARS-CoV Nsp9 can form poly- 
mers (Fig. 7). 

We used the sulfate ions present in one of the 
SARS-CoV Nsp9 structures*> (PDB ID 1QZ8) to 
propose a binding mode for single-stranded RNA 
(ssRNA) (Fig. 7a). In this crystal structure, three 
sulfate ions are located near one of the two mono- 
mers, two of them in the vicinity of the completely 
conserved lysine residues 50 (52; we use the HCoV- 
229E numbering scheme here, with numbers for 
SARS-CoV in parentheses) and 88(92), and one 
interacting with residue 46(48; Lys46 in 229E, His48 
in SARS-CoV). By superimposition with the struc- 
ture of the wild-type HCoV-229E Nsp9, which was 
crystallized from sulfate, two further sulfate-bind- 
ing sites are revealed. One is also near Lys50(52), but 
in a different position, and the other interacts with 
Lys82(86). The resulting five independent sulfate 
positions were used to define a path for ssRNA on 
the surface of the monomer and, subsequently, the 
polymer. We also note that the residues that we 
propose to interact with the ssRNA on the basis of 
this model (Lys10, Lys50, Tyr51, Arg70, Tyr83, 
Lys88, and Argl107) are better conserved, on 
average, than the polypeptide sequence. In our 
crude model, the ssRNA forms a left-handed helix 
wrapping around the Nsp9 polymer, similar to the 
model proposed recently for the nucleocapsid 
protein of SARS-CoV interacting with ssRNA.*! In 
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Fig. 7. Oligomers of SARS-CoV Nsp9 (PDB codes 1QZ8 (a) and 1UW7 (b)) and the HCoV-229E Nsp9 Cys69Ala mutant 
(c). Independent of space group symmetry (1QZ8, P6)22; LUW7, P4322; and HCoV-229E Nsp9 Cys69Ala mutant, P2;2)21), 
two common dimer interfaces are present in these crystal structures of Nsp9. One interface is formed mainly by the C- 
terminal a-helix (red), and the other by strand £5 (blue). We propose that the ssRNA (black) could wrap around the Nsp9 
polymer by forming a left-handed helix (a), with approximately 40 nucleotides bound per Nsp9 dimer. 
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our model, approximately 40 nucleotides can be 
bound per Nsp9 dimer within the extended polymer 
that we propose for SARS-CoV Nsp9 (Fig. 7a). 

Nsp9 interaction with nucleic acids was visualized 
using a slightly modified version of zone inter- 
ference gel electrophoresis (ZIGE; for details, see 
Materials and Methods).** HCoV-229E and SARS- 
CoV Nsp9 bind to single-stranded and, to a limited 
extent, double-stranded deoxyoligonucleotides 
without any sequence specificity (Deoxyribonucleo- 
tides were used instead of ribonucleotides because 
they showed identical behavior in test runs). In 
agreement with the structural data discussed above, 
surface-exposed positively charged residues could 
interact with the sugar-phosphate backbone of the 
nucleic acid. However, there is a strong correlation 
between the length of the oligonucleotide and the 
gel mobility-shift observed (Fig. 5a and b). Although 
the dimerization modes of HCoV-229E and SARS- 
CoV Nsp9 wild-type proteins are very different, 
their binding profile to nucleic acid in the gel 
mobility-shift experiment is similar (neglecting the 
stepwise rather than linear increase of the gel shift 
with oligonucleotide lengths in case of the SARS- 
CoV protein). On the other hand, the HCoV-229E 
Cys69Ala mutant has a dimerization mode similar 
to that of the wild-type SARS-CoV Nsp9 but it does 
not show binding with the nucleic acid in the gel 
mobility-shift experiment (except for the small shift 
seen for the 55-mer, the longest oligonucleotide 
tested). This inability to bind oligonucleotides could, 
in principle, be due to a direct interaction of the 
Cys69 with the nucleic acid. Therefore, we prepared 
an additional three mutants; namely, HCoV-229E 
Cys69Ser and the corresponding mutants of SARS- 
CoV Nsp9, Cys73Ala and Cys73Ser. Similar to 
HCoV-229E Cys69Ala, the Cys69Ser mutant did not 
exhibit a significant shift in the ZIGE experiment. 
However, the homologous SARS-CoV Cys73Ala 
and Cys73Ser mutants did show a shift of the 
same magnitude as the SARS-CoV Nsp9 wild-type 
(Fig. 5c). It remains to be understood why the 
HCoV-229E Cys69Ala and Cys69Ser mutants appar- 
ently do not bind nucleic acids, whereas the corres- 
ponding SARS-CoV mutants do. 

SPR was used to further analyze the interaction 
between nucleic acids and the different dimeric 
forms of HCoV-229E Nsp9. An “aged” preparation 
(two weeks old) of wild-type HCoV-229E Nsp9 
showed a strong signal for the binding with a 50- 
mer oligonucleotide under non-reducing conditions. 
Freshly prepared wild-type HCoV-229E Nsp9 gave a 
much weaker binding signal under reducing condi- 
tions. In the case of the HCoV-229E Nsp9 Cys69Ala 
mutant, the signal was equally small, irrespective of 
whether DTT was present. These relatively weak 
signals still indicate significant binding to the 50-mer, 
albeit much weaker than that found for the wild-type 
HCoV-229E protein in its oxidized state. This is in 
agreement with the gel mobility-shifts, where we had 
observed a weak shift for the HCoV-229E mutants in 
the case of the longest oligonucleotide examined 
(the 55-mer), but not with shorter oligonucleotides. 


Why does the wild-type HCoV-229E Nsp9, but not 
the SARS-CoV protein, form a disulfide-linked 
homodimer even though it has Cys69(73) con- 
served? The chemical environment of the cysteine 
residues are the same in the two structures, i.e. there 
is no special structural feature in HCoV-229E Nsp9 
that would cause a particular reactivity of Cys69. 
However, in contrast to HCoV-229E, the SARS-CoV 
protein has two additional cysteine residues at 
positions 14 and 23. All three cysteines are in the 
free form, as we could determine using Ellman’s 
reagent (data not shown). If one of these cysteines 
was involved in an intermolecular disulfide bond, 
the latter would probably be reduced by the 
remaining ones, so that a disulfide-bonded dimer 
would be unlikely to be the dominant species. In 
agreement with this argument, there are few 
proteins that contain a disulfide bond in addition 
to a free cysteine.** (An exception is the cysteine 
proteases of the papain family, where the active-site 
cysteine has special properties, such as an unusual 
pK, value). 

The question remains of whether the disulfide- 
bonded form of wild-type HCoV-229E Nsp9 is an 
artifact that may have occurred during protein 
preparation. As mentioned before, we observed 
that a freshly prepared sample of this protein gave a 
reaction with Ellman’s reagent, but not so after one 
day. Apparently, there is a correlation between the 
age of the sample and disulfide formation, even 
though the reducing agent (5 mM DTT) was added 
at regular intervals. It is known that DTT is subject to 
oxidation itself; its half-life at 20 °C is 10h and 40h 
at pH 7.5 and 6.5, respectively.** We determined the 
concentration of DTT required to fully reduce the 
Nsp9 disulfide as 10 mM; however, the protein 
would not crystallize at this concentration. 

As the disulfide-bonded form of HCoV-229E Nsp9 
binds oligonucleotides with much higher affinity 
than the reduced form, we believe that it may indeed 
have a biological role, possibly in response to the 
oxidative stress induced by the viral infection of the 
host cell. There are several earlier reports suggesting 
the regulation of DNA/RNA-binding proteins by 
redox processes. Thus, the redox state has been 
shown to determine the interaction with DNA of the 
multifunctional eukaryotic SSB protein, RPA.** 
Also, many transcription factors including Fos, 
Jun, NF-KB, PaX, FNR, OxyR are regulated by the 
redox state of the environment.” Another example 
is the p53 tumor suppressor protein, which binds to 
DNA with sequence specificity only in the reduced 
state. Disulfide formation in the oxidized state alters 
the conformation and the protein binds DNA 
without any sequence specificity.*° 

Several viruses have been reported to induce 
oxidative stress in the infected cell. Among the 
coronavirus family, TGEV was shown to induce 
apoptosis in the infected cell via oxidative stress.* 
Similarly, rhinovirus’ and baculovirus*® induce 
oxidative stress in the infected cell. More specifically, 
LEF3 (the SSB of baculovirus) shows a DNA- 
annealing effect in its oxidized state, whereas in the 
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reduced state, its DNA-unwinding activity is 
favored. Cys214 is apparently involved in DNA 
binding; when mutated to serine, both DNA-binding 
and unwinding activities are reduced.”’ It has been 
hypothesized that this cysteine could form an 
intermolecular disulfide bridge and thus results in 
LEF3 oligomers in solution. This could allow more 
DNA to bind in closer proximity, thus favoring the 
annealing effect.*” The E2 protein of bovine papil- 
loma virus type 1 and ICP8 of herpes simplex virus 
type 1 also show DNA-binding activity depending 
on the redox state of the environment.°°-? 

What is the relevance of these in vitro studies to the 
situation in the infected host cell? For several RNA 
viruses, including mouse hepatitis (corona)virus 
(MHV) and SARS-CoV, it has been shown that 
viral replication is localized to double-membrane 
vesicles that have been hijacked from the endoplas- 
mic reticulum or late endosomes.”* ~’ These double- 
membrane vesicles are 200-350 nm in diameter and 
present in the cytosol alone or as clusters.” The 
milieu inside these vesicles or at their surface is 
unknown, but it is quite possible that it is partially 
oxidative. Since it is here that replicase proteins are 
produced at high levels, it is conceivable that the 
oxidized form of HCoV-229E Nsp9 is the dominant 
species. According to our findings, this form binds 
to ssRNA more tightly than does the reduced form, 
and could therefore promote replication of the viral 
genome. This speculation is supported by the recent 
report by Wuet al. °° who have shown that oxidative 
stress in the host cell promotes HCoV-229E infectiv- 
ity. Very likely, SARS-CoV will also induce oxidative 
stress in the infected host cell,°? even though its 
Nsp9 does not seem to form disulfide-linked dimers, 
at least not in vitro. The replicase of this virus may 
have other mechanisms to deal with an oxidative 
environment. We note that the number of cysteine 
residues is above average in coronavirus replicase 
proteins; in SARS-CoV, their share is 3.9% (HCoV 
229E, 3.3%), whereas only 1.25% of residues in 
human cytosolic proteins are cysteine.°” In the 
SARS-CoV main proteinase (MP"°) alone, there are 
12 cysteine residues (3.9% of all residues), whereas 
this number is 8 (2.6%) in HCoV-229E MP"°. Some of 
these could perhaps scavenge oxygen radicals, 
thereby undergoing oxidation to sulfenic, sulfinic, 
or even sulfonic acid. 


Materials and Methods 


Cloning, expression and purification 


The regions coding for Nsp9 of HCoV 229E and SARS- 
CoV were amplified by PCR from virus-derived cDNA 
fragments. The HCoV-229E nsp9 PCR product was cloned 
into the pET15b vector, resulting in pPETHCoV-229E/nsp9, 
which contained the full-length gene (coding for ppla 
residues 3825-3933) with an N-terminal extension 
(MHHHHHHVKLO), including a Hisg tag for protein 
purification and a SARS-CoV main proteinase (MP"°) 
cleavage site (VKLQ‘NNE...) for tag removal. The same 


approach was chosen for SARS-CoV Nsp9, pp1a residues 
4118-4230, resulting in construct pETSARS-CoV/nsp9. 
Four mutants were prepared using single-site mutagen- 
esis. The corresponding codon of the amino acid to be 
mutated was encoded in the primer. The pETHCoV-229E/ 
nsp9 or pETSARS-CoV/nsp9 plasmids were used as 
template in the PCR reaction. The PCR product corre- 
sponding to a size of ~6.05 kb was purified and the 
template was removed using the restriction enzyme Dpnl. 
The restricted product was transformed into E. coli XL1- 
blue supercompetent cells. The correctness of the muta- 
tions (Cys69—Ala and Cys69—Ser for HCoV 229E, and 
Cys73—Ala, Cys73—Ser for SARS-CoV) was confirmed 
by DNA sequencing. 

The wild-type Nsp9 of HCoV 229E and of SARS-CoV and 
the mutant proteins were produced recombinantly and 
purified in a similar manner. Nsp9-encoding plasmids were 
transformed in the competent E. coli Tuner (DE3) pLacl 
strain (Merck KGaA, Darmstadt, Germany). Cultures were 
grown in TY medium at 37 °C until cells reached an 
absorbance of 0.4 at 660 nm. Cells were then induced with 
1 mM IPTG and grown for a further 4 h at 37 °C. The cells 
were then harvested by centrifugation at 7200g for 30 min at 
4 °C. The resulting pellets were frozen at —20 °C. For lysis, 
the cell pellets were resuspended in 20 mM Tris-HCl, 
300 mM NaCl, 5 mM DTT, and 20 mM imidazole, pH 7.5 
(25 °C). Cells were broken by French press after adding 
glycerol to 10% (v/v) and one Complete EDTA-free protease 
inhibitor cocktail tablet (Roche Diagnostics, Mannheim, 
Germany). To optimize the solubility of overproduced 
Nsp9, a sparse-matrix screen of buffer composition was 
applied.*! The sample was centrifuged at 150,000g for 1 hat 
4 °C. The supernatant was applied to a His Trap HP column 
(1 ml, GE Healthcare, Freiburg, Germany) with a flow rate of 
1 ml/min. After washing with 20 mM Tris-HCl, 300 mM 
NaCl, 5 mM DTT, and 20 mM imidazole, pH 7.5 (25 °C), the 
protein was eluted with a linear gradient of 20 mM-500 mM 
imidazole. For SARS-CoV Nsp9 purification alone, the 
buffers were adjusted to pH 8.0 (25 °C) and 8 11 of Benzonase 
(Merck KGaA, Darmstadt, Germany) was added after 
breaking the cells for 15 min at 37 °C, in order to hydrolyze 
contaminant E. coli nucleic acids. Protein was blotted and 
detected with anti-tetrahistidine antibody (Dianova, Ham- 
burg, Germany) and anti-mouse IgG-alkaline phosphatase 
conjugate (Sigma, Munich, Germany). 

Before cleaving the N-terminal His, tag, proteins were 
dialysed against 20 mM Tris, 200 mM NaCl, pH 7.5 
(25 °C). SARS-CoV MP"® carrying a C-terminal His, tag 
was added to a molar ratio of 1:100 and cleavage was 
allowed to continue for 16 h at 37 °C in the presence of 
5 mM DTT. The protein solution was applied to a His Trap 
HP column (1 ml, GE Healthcare, Freiburg, Germany) 
with a flow rate of 1 ml/min. His, tag-cleaved Nsp9 
passed through the column whereas uncleaved Nsp9 and 
MP'° were bound to it. 

SDS gel electrophoresis (without B-mercaptoethanol) 
was used to determine the minimum concentration of DTT 
required to reduce the disulfide bond. DTT solutions were 
freshly prepared and different concentrations of DTT 
(increasing in 1 mM steps) were added to the Nsp9 
preparation. After incubation for 30 min at room 
temperature, 2% loading buffer was added and SDS- 
PAGE was started. 


Crystallization, structure determination, and 
refinement 


Crystallization screening was performed for both the 
HCoV-229E Nsp9 wild-type protein and the Cys69Ala 
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mutant using a Phoenix robot (Dunn Labortechnik, 
Thelenberg, Germany) with the sitting-drop, vapor-diffu- 
sion method. Initial hits were optimized by manually 
setting up 2 ul hanging drops consisting of 1 j1l of protein 
solution and 1 yl of reservoir solution. Both wild-type and 
mutant protein crystals appeared within two days at 10 °C 
and a protein concentration of 6-8 mg/ml, in 20 mM Tris, 
200 mM NaCl, 1 mM DTT, pH 7.5. The most useful 
precipitants were ammonium sulfate at pH 4.0-4.5 for 
wild-type Nsp9, and polyethylene glycol monomethyl 
ether 2000 (pH 4.6) for the mutant (see Results for 
optimized conditions). X-ray diffraction data were col- 
lected at 100 K using synchrotron radiation of wavelength 
0.8075 A, provided by the University of Hamburg— 
University of Liibeck - EMBL beamline X13 at Deutsches 
Elektronen-Synchrotron (DESY), Hamburg. The cryopro- 
tectant was 25% glycerol for the wild-type protein and 
25% polyethylene glycol 400 for the mutant. Data were 
processed with DENZO™ and scaled using SCALA.” The 
structures were solved by using the molecular replace- 
ment program Phaser,o* with a monomer of SARS 
coronavirus Nsp9 (1QZ8) as the search model. The initial 
solutions had R-factors of 51.0% (wild-type) and 50.8% 
(mutant). Models were built into electron density using 
COOT,* and refined by REFMAC5.” The final model for 
wild-type HCoV-229E Nsp9 includes 70 water molecules, 
two sulfate ion sites, and one MPD molecule. The 
structural model for the mutant contains 90 water 
molecules and one DIT molecule. The final steps of 
refinement of the mutant structure incorporated TLS 
refinement with residues 4 — 109 of monomer A as one 
group and residues 6 — 106 of monomer B as the second 
group. The overall geometric quality of the models was 
checked using PROCHECK.”° The surface area buried 
upon dimerization was calculated using AREATMOL.” 
Structure superimposition and calculation of r.m.s. devia- 
tions were carried out using ALIGN.” The figures were 
created using PyMOL.®* 


Dynamic light-scattering 


Measurements were taken using a Spectroscatter 201 
(RiINA GmbH, Berlin, Germany) using 10 pl solution of 
Nsp9 in 20 mM Tris-HCl, 200 mM NaCl, 1 mM DTT, pH 
7.0 (25 °C). All measurements were done at a protein 
concentration of ~10 mg/ml. The results were analysed 
using the software provided by the manufacturer. Experi- 
mental errors were estimated as standard deviations 
calculated from ten measurements for each sample. 


Electrophoretic mobility-shift assay 


ZIGE is a method designed to analyze weak protein/ 
nucleic-acid interactions.'°** Because of the high iso- 
electric point of Nsp9, this method was modified slightly. 
HCoV-229E Nsp9 and SARS-CoV Nsp9 have a theoretical 
pl of 9.3 and 9.1, respectively; thus, the protein is 
positively charged at the pH of the running buffer (8.3), 
whereas the nucleic acids are negatively charged and will 
move towards the opposite pole during electrophoresis. 
Therefore, only the oligonucleotides (100 4M) were loaded 
into the sample slots and run for 40 min at 100 mA and 
4 °C. Subsequently, 50 1M protein was loaded into the 
respective lanes and the run was continued for another 
40 min, with the poles of the electrodes interchanged. With 
this design of the experiment, the protein and the nucleic 
acid run in opposite directions and meet in the middle of 


the agarose gel. If there is interaction between the two, the 
protein will move in the same direction as the nucleic acid, 
whereas if there is no interaction, the protein will simply 
pass the nucleic acid. Gel mobility-shift assays were 
performed using a horizontal 1% agarose gel system at 
pH 8.3 in TBE buffer (20 mM Tris, 50 mM boric acid, 
0.1 mM NaEDTA, pH 8.3 (25 °C) adjusted by the addition 
of acetic acid). The protein and DNA samples were mixed 
with dimethylsulfoxide (DMSO) at a final concentration of 
10% (v/v) and a trace of bromophenol blue (BPB) dye. 
After electrophoresis, the gel was fixed in 3.5% (w/v) a- 
sulfosalicilic acid, 10% (w/v) trichloroacetic acid, until the 
dye turned yellow. At this stage, zones with higher 
concentrations of ligands such as oligonucleotide can be 
seen as a whitish area. For the detection of protein bands, 
the gel was washed for 15 min in 15% (v/v) ethanol, 8% 
(v/v) acetic acid, and stained for 30 min with 0.25% (w/v) 
Coomassie brilliant blue in the same solution with 
additional 10% (v/v) methanol. The gel was washed in 
15% (v/v) ethanol, 8% (v/v) acetic acid, and stored in 10% 
(v/v) acetic acid. 


Crosslinking 


Chemical cross-linking experiments were performed for 
wild-type HCoV-229E Nsp9, the Cys69Ala mutant, and 
SARS-CoV Nsp9 with glutaraldehyde. Different concen- 
trations of Nsp9 were incubated in a final volume of 20 pl 
of cross-linking buffer (50 mM NaH>PO, (pH 7.6), 50 mM 
NaCl) for 5 min at 20 °C. After addition of 0.01% (w/v) 
glutaraldehyde, the reaction tubes were incubated for 
5 min at 20 °C, before the reactions were stopped by the 
addition of 100 mM Tris. In the case of the Nsp9-DNA 
complex, oligonucleotides and proteins were incubated 
for 2 h at 20 °C before glutaraldehyde crosslinking. The 
samples were analyzed by SDS-PAGE after adding 10 11 of 
2% sample-loading buffer and incubation at 70 °C for 
10 min. 


Surface plasmon resonance analysis 


The streptavidin-coated sensor chip (SA chip, Biacore, 
GE Healthcare, Freiburg, Germany) was preconditioned 
before immobilization, by treating it three times with 1M 
NaCl in 50 mM NaOH, 0.05% (w/v) SDS for 60 s at a flow 
rate of 5 pl/min. 5’-Biotinylated 50-mer oligonucleotide 
was purchased from MWG-Biotech AG, Ebersberg, 
Germany. For immobilization, 2 nM oligonucleotide was 
injected in manual mode until the desired amount of DNA 
on the surface was reached. To reduce non-specific 
binding of protein to the surface of the SA chip, the chip 
was activated with 50 mM N-hydroxysuccinimide and 
200 mM 1-ethyl-3-(3-dimethylaminopropyl) carbodiimide 
(NHS/EDC) and deactivated five times with 1 M ethanol- 
amine. Saturation of residual free streptavidin-binding 
sites was achieved with 0.4% biotin. Unless mentioned 
specifically, all the injections were done at a flow rate of 
10 pl/min in the running buffer (10 mM Hepes pH 7.4, 
150 mM NaCl (HBS-N)). For determining the Kp value, the 
oligonucleotide was immobilized up to 300 RU. (For the 
set of experiments analyzing the interactions between 
oligonucleotide and the reduced or oxidized forms of 
HCoV-229E Nsp9, this value was 88 RU). HCoV-229E 
Nsp9 wild-type and SARS-CoV Nsp9 were injected into 
the flow cell for 22 min and 30 min, respectively. The 
concentration used for determining the Kp value ranged 
from 2 1M to 85 4M. The Kp values were calculated using 
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a steady-state equilibrium analysis in a 1:1 Langmuir 
single-state binding model or a two-state binding model. 
The single-state binding model is: 


(Ka x C Xx Rmnax) 


Rea = (KK, x Cxnd1) 


The two-state binding model is: 


_ (Kai xC  Rimax) j (Kao x Cx Romax) 
“1 (Kar X Cxn+1)° (Kap x Cxn+1) 


R 


Where Ka is the equilibrium association constant, C is the 
concentration of injected protein, Ryax is the maximum 
analyte-binding capacity and n is the steric interference 
factor. 

Regeneration was achieved using 0.05% SDS for 60 s. 
The efficiency of the regenerated SA chip was tested with 
multiple injections after the regeneration. The data were 
analyzed with the BIAevaluation software, version 3.2. 


Protein Data Bank accession codes 


Atomic coordinates of wild-type HCoV-229E Nsp9 and 
the Cys69Ala mutant have been deposited with the RCSB 
Protein Data Bank, with accession code 2J97 and 2J98, 
respectively. 
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